05. MC Prediction - Part 2

MC Prediction

L605 MC Prediction Part 2 RENDER V3

## Quiz

To check your understanding of the video, please answer the question below.

Which of the following is true? (Select all that apply.)

SOLUTION:
  • If the agent follows a policy for many episodes, we can use the results to directly estimate the action-value function corresponding to the same policy.
  • The Q-table is used to estimate the action-value function.